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Abstract 

This paper studies fitness inheritance as an efficiency enhancement technique for a class of 
competent genetic algorithms called estimation distribution algorithms. Probabilistic models of 
important sub-solutions are developed to estimate the fitness of a proportion of individuals in 
the population, thereby avoiding computationally expensive function evaluations. The effect of 
fitness inheritance on the convergence time and population sizing are modeled and the speed-up 
obtained through inheritance is predicted. The results show that a fitness- inheritance mechanism 
which utilizes information on building-block fitnesses provides significant efficiency enhancement. 
For additively separable problems, fitness inheritance reduces the number of function evaluations 
to about half and yields a speed-up of about 1.75-2.25. 



1 Introduction 



Since the inception of genetic and evolutionary algorithms (GEAs), significant advances have been 
made in the theory, design and application to complex real-world problems. A decomposition 
methodology has been proposed for a successful design of GEAs ( Goldb eFg, 1991} Goldberg, Deb, &; Clark, 1 992 ; 
Gold berg, 2002| ). Based on the design-decomposition theory several competent GEAs — genetic 
algorithms (GAs) that solve hard problems quickly, reliably, and accurately — have been pro- 
posed (Goldberg, 2002). Competent GEAs successfully solve boundedly difficult problems, of- 
tentimes requiring polynomial — in terms of problem size — number of function evaluations. In 
essence, competent GEAs take problems that were intractable with first generation GAs, and 



1 



render them tractable. One such class of competent GAs is the probabilistic model-building GAs 
(PMBGAs) ( [Pelikan, Lobo, Goldberg, 20021 [Larranaga Lozano, 20021 |Pelikan, 2002D , wherein 
a traditional crossover is replaced by a two step process: (1) Building of probabilistic mod- 
els that identify important sub-solutions (or building blocks (BBs)). (2) Sampling the proba- 
bilistic models to efficiently mix the sub-solutions to create the offspring population. PMBGAs 
have solved problems of bounded difficulty (both regular as well as hierarchical) requiring only 
quadratic number of function evaluations ( Pelik an"72002[ |Pelikan, Goldberg, &: Gantu-Paz, "20 00b 
Pelikan fc Goldberg, 200"Tj |Pelikan, Sastry, Goldberg, 2003) . 



However, for large-scale problems, especially if the fitness function is a complex simulation, 
model, or computation, the task of computing even subquadratic number of evaluations can be 
daunting. This places a premium on a variety of efficiency-enhancement techniques (EETs). 
In essence, while competence leads us from intractability to tractability, efficiency enhancement 
takes us from tractability to practicality. Various EETs can be broadly classified into four broad 
classes (Goldberg, 2002): parallelization, hybridization, time-utilization and evaluation relaxation. 
In evaluation-relaxation schemes the computationally expensive, but accurate, fitness function is 
replaced by a cheap, but approximate, fitness function, there by speeding-up the GA process. 
One such evaluation-relaxation technique is fitness inheritance (Sm ith, Dike, &; "S tegmann, 1995 
Sastry, Goldberg, & Pelikan, 2001). In fitness inheritance, a certain proportion of the offspring 



derive their fitness values from parental fitnesses rather than through function evaluations. 

This paper analyzes fitness inheritance in PMBGAs, specifically extended compact genetic 
algorithm (eCGA) flHarik, 19 99). In our study, the inherited fitness is derived and estimated from 
the probabilistic model constructed by eCGA. The purpose of this paper is to model such an 
inheritance mechanism and predict the scalability, speed-up and the optimal inheritance parameter 
that yields greatest speed-up. The approach used in this study is along the lines of those reported 
elsewhere for fitness inheritance in simple GAs (Sastry, Goldberg, & Pelikan, 2001). The analysis, 
in which the inherited fitness of an offspring was the average of its parental fitness values, predicted 
that using inherited fitness for 50% of the offspring population yielded a maximum speed-up of 
30%. This was in contrast to a much more significant empirical speed-up observed by Smith, Dike, 
and Stegmann (Smi th, Dike, &; Stegmann, 1995). The purpose of this paper is also to investigate if 
an inheritance mechanism that incorporate knowledge of building block (or sub-solution) fitnesses 
would provide greater speed up than a simple inheritance procedure. 

This paper is organized as follows. The next section reviews the literature on fitness inheritance 
in GAs, followed by a brief description of extended compact genetic algorithm in Section |31 We 
derive facetwise models for convergence time and population sizing required for the successful design 
of eCGA with fitness inheritance in Section |1J Subsequently, we use the convergence-time and 
population-sizing models to predict the scalability and speed-up obtained via fitness inheritance. 
Section El discusses possible directions of future research, followed by conclusions. 



2 Literature Review 

Smith, Dike and Stegmann (Smith, Dike, & Stegmann, 1995) proposed fitness inheritance in GAs. 
They proposed two ways of inheriting fitness, one by taking the average fitness and the other by 
taking a weighted average of the fitness of the two parents. Their results indicated that GAs with 
fitness inheritance outperformed those without inheritance in both the OneMax and an aircraft 
routing problem. However, they did not investigate the effect of fitness inheritance on convergence 
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time, population sizing, and scalability of GAs. Though the original study showed very encouraging 
results, unfortunately there have been very few follow up studies on fitness inheritance. Zheng, 
Julstrom, and Cheng (Zheng, Julstr om, Cheng, 1997 ) used fitness inheritance for the design of 
vector quantization codebooks. 

Sastry, Goldberg, and Pelikan ( Sastry, G oldberg, &: Pclikan, 2001) provided theoretical anal- 
ysis of fitness inheritance in simple GAs, in which facetwise models for convergence time and 
population sizing were derived and utilized for predicting the scalability and speed-up obtained by 
using fitness inheritance. The analysis predicted that an optimal speed-up of 30% can be achieved 
when 50% of the offspring population received inherited fitness. Chen, Goldberg, Ho, and Sastry 
( Chen, Goldberg, Ho, Sastry, 2002 ) extended the analysis of fitness inheritance to multiobjective 
problems and predicted that a speed-up of 25% can be achieved when 50-55% of the offspring popu- 
lation received inherited fitness. Ducheyne, De Baets, and De Wulf (Ducheyne, De Baets, & De Wulf, 
studied the utility of fitness inheritance in convex and non-convex multiobjective problems. Pe- 
likan and Sastry (Pelikan fc Sastry, 2004 ) have proposed fitness inheritance for the efficiency- 
enhancement of Bayesian optimization algorithm. 



2003) 



3 Extended Compact Genetic Algorithm (eCGA) 



The extended compact GA proposed by Harik ( Harik, 1999 1 is based on a key idea that the choice 



of a good probability distribution is equivalent to linkage learning. The measure of a good distri- 
bution is quantified based on minimum description length (MDL) models. The key concept behind 
MDL models is that all things being equal, simpler distributions are better than more complex 
ones. The MDL restriction penalizes both inaccurate and complex models, thereby leading to an 
optimal probability distribution. Thus, MDL restriction reformulates the problem of finding a 
good distribution as an optimization problem that minimizes both the probability model as well 
as population representation. The probability distribution used in eCGA is a class of probability 
models known as marginal product models (MPMs). MPMs are formed as a product of marginal 
distributions on a partition of the genes and are similar to those of the compact GA (CGA) 
(Harik, Lobo, fc Goldberg, 1998|) and PBIL flBaluja, 19941). Unlike the models used in CGA and 



PBIL, MPMs can represent probability distributions for more than one gene at a time. MPMs also 
facilitate a direct linkage map with each partition separating tightly linked genes. For example, 
the following MPM, [1,3] [2] [4] , for a four-bit problem represents that the 1 st and 3 rd genes are 
linked and 2 nd and 4 th genes are independent. Additionally, the MPM consists of the marginal 
probabilities: {p(xi = 0,x 3 = 0), p(x\ = 0,x 3 = 1), p(x\ = l,x 3 = 0), p{x\ = l,x 3 = 1), p(x 2 = 0), 
p(x2 = 1), p(x4 = 0), p(x4 = 1)}, where Xi is the value of the i th gene. 
The eCGA can be algorithmically outlined as follows: 

1. Initialization: The population is usually initialized with random individuals. However, other 
initialization procedures can also be used. 

2. Evaluate the fitness value of the individuals 

3. Selection: The eCGA uses s-wise tournament selection ( Goldberg, Korb, Deb, 1989). How- 



ever, other selection procedures can be used instead of tournament selection. 
4. Build the probabilistic model: In eCGA, both the structure and the parameters of the model 
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are searched. A greedy search heuristic is used to find an optimal model of the selected 
individuals in the population. 

5. Create new individuals: In eCGA, new individuals are created by sampling the probabilistic 
model. 

6. Replace the parents with the offspring. 

7. Repeat steps 2-6 until some termination criteria are met. 

Two things need further explanation, one is the identification of MPM using MDL and the other 
is the creation of a new population based on MPM. The identification of MPM is formulated as a 
constrained optimization problem, 

Minimize C m + C p (1) 
Subject to 

2 k ' <n ViG [l,m\. (2) 

where C m is the model complexity which represents the cost of a complex model. In essence, the 
model complexity, C m , quantifies the model representation size in terms of number of bits required 
to store all the marginal probabilities. Let, a given problem of size I with binary alphabets, have m 
partitions with k{ genes in the i th partition, such that Y^Li hi = Then each partition i requires 
2 fc — 1 independent frequencies to completely define its marginal distribution. Furthermore, each 
frequency is of size log 2 (n), where n is the population size. Therefore, the model complexity C m , 
is given by 

m 

C m = log 2 (n)^(2^-l). (3) 

i=\ 

The compressed population complexity, C p , represents the cost of using a simple model as 
against a complex one. In essence, the compressed population complexity, C p , quantifies the data 
compression in terms of the entropy of the marginal distribution over all partitions. Therefore, C p 
is evaluated as 

m 2 k i 

C p = n^2J2 -Pij l °S2 (Pij) ( 4 ) 
i=l j=i 

where pij is the frequency of the j th gene sequence of the genes belonging to the i th partition. 
In other words, = Nij/n, where Nij is the number of chromosomes in the population (after 
selection) possessing bit-sequence j £ [l,2 fci ] 1 for i th partition. The constraint (Equation |2J) arises 
due to finite population size. 

The following greedy search heuristic is used to find an optimal or near-optimal probabilistic 
model: 

1. Assume each variable is independent of each other. The model is a vector of probabilities. 

2. Compute the model complexity and population complexity values of the current model. 

1 Note that a BB of length k has 2 h possible sequences where the first sequence denotes be 00- • -0 and the last 
sequence If - ■ -1 
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3. Consider all possible — 1) merges of two variables. 

4. Evaluate the model and compressed population complexity values for each model structure. 

5. Select the merged model with lowest combined complexity. 

6. If the combined complexity of the best merged model is better than the combined complexity 
of the model evaluated in step 2., replace it with the best merged model and go to step 2. 

7. If the combined complexity of the best merged model is less than or equal to the combined 
complexity, the model cannot be improved and the model of step 2. is the probabilistic model 
of the current generation. 

The offspring population are generated by randomly choosing subsets from the current individuals 
according to the probabilities of the subsets as calculated in the probabilistic model. 

4 Fitness Inheritance in eCGA 

The previous section outlined the key steps and mechanisms of eCGA. In this section we discuss the 
enhancements and modifications made on eCGA to enable the incorporation of fitness inheritance. 

Similar to earlier studies on fitness inheritance, all the individuals in the initial population 
are evaluated and subsequently a portion of the offspring population receives inherited fitness 
and the others receive actual fitness evaluation. That is, an offspring receives inherited fitness 
with a probability pi, or an evaluated fitness with a probability 1 — p^. However, unlike previous 
works, which used either average or weighted average of parental fitnesses as the inherited fitness, 
here we employ the probabilistic model built by eCGA and estimates of linkage-group fitnesses in 
determining the inherited fitness. Specifically, individuals from parental population that received 
evaluated fitnesses (that is, individuals whose fitnesses were not inherited) are used to determine 
the fitnesses of schemata that are defined by the probabilistic model. The schema fitness from 
different partitions are then used to estimate the fitness of an offspring. The fitness-inheritance 
procedure is detailed in the following paragraphs. 

After the probabilistic model is built and the linkage map is obtained (step 4 of the eCGA 
algorithm outlined in the previous section), we estimate the fitness of schemata using only those 
individuals whose fitnesses were not inherited. In all, we estimate the fitness of a total of YaLi 2 ki 
schemas. Considering our previous example (Section |3J) of a four-bit problem, whose model is 
[1,3] [2] [4], the schemata whose fitnesses are estimated are: {0*0*, 0*1*, 1*0*, 1*1*, *0**, 
***0, ***l}. 

The fitness of a schema, h, is defined as the difference between the average fitness of individuals 
that contain the schema and the average fitness of all the individuals. That is, 



where rih is the total number of individuals that contain the schema h, xi is the i individual and 
f(xi) is its fitness, n' is the total number of individuals that were evaluated. If a particular schema 
is not present in the population, its fitness is arbitrarily set to zero. Furthermore, it should be 




(5) 
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noted that the above definition of schema fitness is not unique and other estimates can be used. 
The key point however is the use of the probabilistic model in determining the schema fitnesses. 

Once the schema fitnesses across partitions are estimated, the offspring population is created 
as outlined in Section G3 An offspring receives inherited fitness with a probability pi , referred to as 
the inheritance probability. The inherited fitness is computed as follows: 

-. n' m 

/inh (y) = - £ / (Xi) +]T/ S (hi 6 y) (6) 

Tl . . 

1=1 1=1 

where y is the offspring individual. It should be noted that the eCGA model yields non-overlapping 
linkage groups and might not be appropriate for problems with overlapping BBs. However, sim- 
ilar concepts can be incorporated in other PMBGAs such as the Bayesian optimization algo- 
rithm (BOA) dPelikan, Goldberg, Oantu-Paz, 2000bl which can handle overlapping BBs. More- 



over, the inherited fitness can be computed by other methods, some of which are outlined in 
(Pelikan & Sastry, 2004), but the key is to use the estimates of substructure fitnesses in the com- 



putation. 

With this understanding of the inheritance mechanism, we will now model the effects of fitness 
inheritance on population sizing and convergence time in the next sections. These models are then 
used to predict the speed-up (or efficiency enhancement) obtained through fitness inheritance. 



5 Modeling Fitness Inheritance 

To ease the analytical burden, we assume a non-overlapping population of fixed size and a gen- 
erationwise eCGA. We consider binary strings of fixed length as the chromosomes. Furthermore, 
the models assume additively separable (nearly separable) uniformly-scaled problems. That is, we 
assume that the BBs are non-overlapping and the contribution of each BB to fitness is equal. Many 
of the above assumptions are made for simplifying the models and can be relaxed in a straightfor- 
ward manner. We also assume that the probabilistic model accurately map to the building blocks 
of the problem. However, the effect of a wrong model is not significant as we will see later in the 
verification of the models. 



Assuming that the actual fitness distribution, F, is Gaussian with mean fif and variance a 2 



f,f 



and that the distribution of fitness with inheritance, F' is Gaussian with mean up % and variance 



F 



1 =Af(jif>,t,o 2 f>,t) • 



The above assumptions are reasonable since crossover has a normalizing effect. 

From (Sastry, Goldberg, & Pelikan, 2001) we know that fitness inheritance can be modeled as 



an addition of external noise to the actual fitness. 

F' = F+Ar(Q, Pi al t ) (7) 

where pi is the probability of and individual receiving inherited fitness, and a\ t is variance of error 
between the inherited and actual fitnesses. Since the inherited fitness, fmh, is equal to the sum of 
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estimated building-block fitnesses, for additively separable uniformly scaled problems af t is given 
by 

m 
3=1 

= mo BB,mh> ( 8 ) 

where a BB 

inh * s ^ ne variance of the estimated fitness of a building block. Since the noise in the 
estimate of a BB comes from the other m — 1 partitions ( |Goldberg, Deb, Hi Clark, 19 92 ) , 

^BB.inh ~ i( m ~ 1)/™](TBB ~ °BB- 

where cr BB is the actual BB variance. This approximation however is not valid for very high 
inheritance-probability values as the BB fitness is estimated from very few individuals which in- 
crease the error in the estimate significantly. Empirically, we observed that o BB inh becomes sig- 
nificantly higher than the above approximation when pi > 0.85. 

Therefore, the noise variance due to fitness inheritance is given by 

°i,t = ma BB = °),t- (9) 

From the above equation and Equations [5] and [7[ we can write the fitness distribution with inheri- 
tance as 

F' = M (jif,t> (1 + Pi) a }',t) • (10) 
We now proceed to develop population-sizing and convergence-time models. 



5.1 Population Sizing 



Population sizing is one of the important factors that determine GA success. Therefore it is 
essential to appropriately size the population to incorporate the effects of fitness inheritance. Gold- 
berg, Deb, and Clark ( Goldberg, Deb, Hi Clark, 1992 1 proposed population-sizing models for cor- 
rectly deciding between competing BBs. They incorporated noise arising from other partitions 
into their model. However, they assumed that if wrong BBs were chosen in the first gener- 
ation, the GAs would be unable to recover from the error. Harik, Cantu-Paz, Goldberg, and 
Miller ( Harik, Cantu-Paz, Goldberg, &: Miller, 199 9) refined the above model by incorporating cu- 
mulative effects of decision making over time rather than in first generation only. Harik et al. 
(Har ik, Cantu-Paz, G oldberg, & Miller, 1997) modeled the decision making between the best and 
second best BBs in a partition as a gambler's ruin problem. This model is based on the as- 
sumption that the selection process used is tournament selection without replacement. Miller 
(Miller, 1997) extended the gambler's ruin model to incorporate external noise. Pelikan, Gold- 
berg, and Cantu-Paz (Pelik an^ Goldberg, k, Cantu-Paz, 2000a| ) and Pelikan, Sastry, and Goldberg 
( Pelikan, Sastry, Hi Goldberg, 20031 ) developed population-sizing models for PMBGAs, specifically 
for BOA. Sastry and Goldberg ( Sas try Hi Goldberg, 2000 ) empirically demonstrated that the pop- 
ulation sizing of eCGA is similar to that of BOA. 

The population-sizing model which incorporates the effect of model-building and its accuracy 
on the population sizing of the GA, is given by 



n 



-c n log(a)2 fc (a 2 



2 
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where n is the population size, c n is a problem-dependent constant, k is the BB length, a is the 

3 ' 



probability of failure, and a 2 f is the fitness variance, and af^ is the external noise variance. From 



Equation 1101 we can see that afj = Pier 2 . Therefore, the population-size equation can be written as 

(11) 



n 



-c n log(a)2 k aj(l+ Pi ). 

The effect of fitness inheritance on population sizing can be predicted by dividing the above 
equation by the population size required when no inheritance is used. That is, by the population- 
sizing ratio, n/n(pi = 0): 

71 

n r = —, - = (1+Pi). (12) 



n (jpi = 0) 



5.2 Time to Convergence 



Understanding convergence time is one of the key factors in the design of GAs and in predicting their 
scalability (Goldberg, 2002). Miihlenbein and Schlierkamp-Voosen ( Miihlenbein Hi Sch lierka mp-Voosen, 1 993) 
derived the convergence-time model for the breeder GA using the notion of selection intensity 
(Buhner, 1985) from population genetics. Thierens and Goldberg (Thierens Hi Goldberg, 1994) 
derived convergence-time models for different selections schemes including binary tournament se- 
lection. Back (Back, 1994) derived estimates of selection intensity for s-wise tournament and (A, /x) 
selection. Miller and Goldberg (Mi ller Hi Goldberg, 19951 ) developed convergence-time models for 
s-wise tournament selection and incorporated the effects of external noise. Back ( |Back, 1995 |) 
developed convergence-time models for (A, fi) selection. 



The assumptions used in developing the convergence-time model of Miller and Goldberg (Miller Hi Goldberg, 19! 
are applicable to PMBGAs and the model is therefore valid for BOA and eCGA as well ( |Pelikan, Goldberg, Hi Cantu 
An approximate form of the convergence-time model for noisy function evaluations can be written 



as (Goldberg, 2002): 



Vm ■ k 



i + ^f. 



(13) 



where c± is a problem dependent constant. A detailed derivation of the above equation and other 
approximations are given elsewhere flSastry & Goldberg, 20021 |Sastry, 20"0T| ). 

Again the effect of fitness inheritance on the convergence time can be predicted by dividing the 
above equation by the convergence time when no inheritance is used. Recognizing that ajf = PiO~p 
the convergence-time ratio, t c /t c (pi = 0) is given by 



tc. 



tc (Pi = 0) 



y/l+Pi- 



(14) 



5.3 Speed-Up 



We now use the convergence-time and population-sizing models in the previous sections to predict 
the number of function evaluations required for eCGA success. The total number of function 
evaluations can be written as 

n fe =n + n(t c -l)(l-pi). (15) 

Recall that all the individuals in the initial population are evaluated and there after on an average 
n(l — pi) individuals are evaluated. 
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To isolate the effect of fitness inheritance on the scalability of eCGA, we consider the ratio of 
total number of function evaluations required with fitness inheritance and that required without 
fitness inheritance. That is, we consider the function-evaluation ratio, 



rife _ n + n(t c - 1) (1 - pi 



rife(Pi = 0) n(pi = 0)t c (pi = 0) 



*c,r (1 - Pi) + ''' 



"'• 'rril -ftl+77 ^7 , (16) 

tc{Pi = 0). 

rif e ,r ~ n r ■ t c>r (1 - Pi) . (17) 
Substituting Equations ^] and El i n the above equation and simplifying, we get 

n fe>r « (1 + pi) 1 ' 5 (1 - Pi ) (18) 

The speed-up that can be obtained through fitness inheritance is given by the inverse of the 
function-evaluation ratio: 

1 / \ 

Vinh = — L5 r • (19) 

{I+Pi) {I -Pi) 

Equation ^] indicates that the function-evaluation ratio increases (or the speed-up reduces) at 
low pi values, reaches a maximum at pi = 0.2. When pi = 0.2 the number of function evaluations 
required is 5% more than that required without inheritance. In other words, the speed-up at 
Pi = 0.2 is 0.95. For inheritance probabilities above 0.2 the function-evaluation ratio decreases 
(speed-up increases) with the inheritance probability. Equation predicts that the speed-up is 
maximum when pi = 1.0, however, it should be noted that the models derived are not entirely valid 
for higher pi values {pi > 0.95). Empirically we observed that even when pi = 0.95, we obtained 
a speed-up of 1.8-2.0. However, when pi = 1.0, the number of function evaluations required were 
four times than that required without inheritance (speed-up of 0.25). The optimal inheritance 
probability therefore should be around the value where the model prediction deviates. 

The models developed in these sections are empirically verified with the help of two test functions 
in the next section. 



5.4 Model Verification 

Before discussing the empirical verification of the models developed in the previous sections, we 
briefly describe the two test functions used for the model verification. Our approach in verifying 
the models and observing if fitness inheritance yields speed-up is to consider bounding adversarial 
problems that exploit one or more dimensions of problem difficulty (Goldberg, 2002). Particularly, 



we are interested in problems where building-block identification is critical for the GA success. 
Additionally, the problem solver (eCGA) should not have any knowledge of the BB structure of 
the problem, but should be known to researchers for verification. 

The two test functions with the above properties and used in this study are: 

1. OneMax problem, which is a GA-easy problem and in which each variable is independent 
of the others. While the optimization of the OneMax problem is easy, the probabilistic models 
built by eCGA (or PMBGAs) for OneMax, however, are known to be only partially correct 
and include spurious linkages flSastry fc Goldberg, 20001 |Pelikan, Goldberg, fc Sastry, 200ip . 



Therefore, the inheritance results on the OneMax problem will indicate if the effect of using 
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(a) Verification of population-size-ratio model 



(b) Verification of convergence-time-ratio model 



Figure 1: Verification of the population-size-ratio model (Equation 111)) and convergence-time-ratio 
model (Equation [2J for various inheritance proportions with empirical results for 100-bit OneMax 
and 10 4-Trap problems. The population size is determined by a bisection method such that 
the failure probability averaged over 30-100 independent runs is 1/m (that is, a = 1/m). The 
convergence-time is determined by the number of generations required to achieve convergence on 
m — 1 out of m BBs correctly. The results are averaged over 30 independent bisection runs. 



partially correct linkage mapping on the inherited fitness is significant. We use a 100-bit 
OneMax problem for verifying the convergence-time and population-sizing models. 

m k-Deceptive trap problem, which consists of additively separable deceptive functions 
( Goldberg, 19871 |Deb Goldberg, 1993| |Deb fc Goldberg, 1994) . Deceptive functions are 



designed to thwart the very mechanism of selectorecombinative search by punishing any 
localized hillclimbing and requiring mixing of whole building blocks at or above the order of 
deception. Using such adversarially designed functions is a stiff test — in some sense the stiffest 
test — of algorithm performance. The idea is that if an algorithm can beat an adversarially 
designed test function, it can solve other problems that are equally hard or easier than the 
adversary. 

We use a tournament selection with tournament sizes of 4 and 8 in obtaining the empirical 
results. An eCGA run is terminated when all the individuals in the population converge to the 
same fitness value. The average number of BBs correctly converged are computed over 30-100 
independent runs. The minimum population size required such that m — 1 BBs converge to the 
correct value is determined by a bisection method flSastry, 2001 ). The results of population-size and 



convergence-time ratio is averaged over 30 such bisection runs, while the results for the function- 
evaluation ratio is averaged over 900-3000 independent runs. 

The population-size-ratio model (Equation I12|) is verified with empirical results for OneMax 



and m k-Trap in Figure 1(a) The standard deviation for the empirical runs are very small (a E 
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(a) Verification of function-evaluation-ratio model 



(b) Verification of speed-up model 



Figure 2: Verification of the function-evaluation-ratio model (Equation I18|) and the speed-up model 
(Eauation ll9|) with empirical results on 100-bit OneMax and 10 4-Trap problems. The total number 
of function evaluations is determined such that the failure probability of an eCGA run is at most 
1/m. The results are averaged over 900-3000 independent runs. 



[4 x 10 , 1.8x ]), and therefore the error bars are not shown in Figure 1(a) As shown in the 



figure, the empirical results agrees with the model. The population size required to ensure that, 
on an average, eCGA fails to converge on at most one out of m BBs, increases linearly with 
the inheritance probability, pi. The population sizes required at very high inheritance-probability 
values, pi > 0.85 deviates from the predicted values. This is because the noise introduced due to 
inheritance increases significantly at higher pi values because of limited number of individuals with 
evaluated fitness that take part in the estimate of schemata fitnesses. 

The verification of the convergence-time-ratio model (Equation El with empirical results for 



OneMax and m k-Trap are shown in Figure 1(b) The standard deviation for the empirical runs are 



very small (er G [2 x 10 , 2.7 x 10 ]), and therefore the error bars are not shown. As shown in the 
figure, the agreement between the empirical results and the model are slightly poor when compared 
to that for population-size ratio. This is because of the approximations used in deriving the 
convergence time model and more accurate, but complex, models exist that improve the predictions 
(Sastry, 2001). However, as we will see later, this disagreement between the model and experiments 
does not significantly affect the prediction of speed-up, which is the key objective. The empirical 
convergence-time ratio deviates from the predicted value at slightly lower inheritance probabilities, 
Pi > 0.75 than the population-size ratio. This is to be expected as the population sizing is largely 
dictated by the fitness and noise variances in the initial few generations, while the convergence 
time is dictated by the fitness and noise variances over the GA run. Therefore, the effect of high 
Pi values, or less number of evaluated individuals, is cumulative over time and leads to deviation 
from theory at lower pi values than the population size. 
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The predicted values of function-evaluation-ratio ( Equation ITSjl and the speed-up (Equation IT§|) 
are verified with empirical results for OneMax and m k-Trap in Figures |2 (a) | and 2(b) respectively. 



The standard deviation for the empirical runs are very small (a G [7 x 10~ h ,7 x 10" 3 ]), and 



therefore are not shown. As shown in Figures 2(a) and 2(b) the empirical results agree with 



the analytical models. Furthermore, the agreement for the OneMax problem with the models 
is good even though the building-block identification for the OneMax problem is only partially 
correct. The results show that the required number of function evaluations is almost of halved 
with the use of fitness inheritance thereby leading to a speed-up of 1.75-2.25. This is a significant 
improvement over a speed-up of 1.3 observed for simple GAs with a simple inheritance mechanism 
( Sastry, Goldberg, &: Pelikan, 2001). Furthermore, fitness inheritance yields speed-up even when 



the inheritance probability is very high (as high as 0.95), which agrees with the empirical observation 
of Smith, Dike, and Stegmann (Smith, Dike, h Stcgmann, 1995). 



Overall, the results suggest that significant efficiency enhancement can be achieved through an 
inheritance mechanism that incorporates knowledge of important sub-solutions of a problem and 
their partial fitnesses. 

6 Future Work 

In this paper, we have introduced and analyzed a fitness inheritance mechanism that incorporates 
the knowledge of building-block structure and fitness. The BB structure and fitnesses are identified 
and estimated with the help of extended compact genetic algorithm, a competent GA. The results 
show that significant speed-up can be obtained through fitness inheritance and warrants further 
research and enhancements in many avenues some of which are listed in the following: 

• Problems with overlapping building blocks: While this paper considered problems with 
non-overlapping building blocks, many problems have different building blocks that share 
common components. While considering problems with overlapping building blocks, eCGA 
might not be the appropriate search method, however the fitness inheritance mechanism 
should still be valid which can be used in other more advanced PMBGAs such as the Bayesian 
optimization algorithm flPelikan &: Sastry, 2004 1. 



Non-Uniformly scaled problems: In this paper, all the building blocks of a problem had 
equal salience, which might not be the case in real-world problems. The non-uniform scaling 
induces sequentiality in the identification and convergence of the building blocks. The effect 
of non-uniform building-block salience on the speed-up and optimal inheritance proportion 
should be investigated. 

Hierarchical problems: One of the important class of nearly decomposable problems is hi- 
erarchical problems ( |Pelikan Hi Goldberg, 200T I , in which the building-block interactions are 



present at more than a single level. Such problems can be successfully solved in polynomial 
time by the hierarchical Bayesian optimization algorithm . The fitness inheritance mecha- 
nism used in this study could be enhanced and incorporated into hBOA and the efficiency 
enhancement provided by inheritance can be investigated. 

Additional dimensions of problem difficulty: In this paper we considered one of the 
dimensions of GA problem difficulty, deception. However, there are other dimensions of prob- 



lem difficulty ( Goldberg, 2002 ) such as epistasis and external noise. This factors should be 
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included in isolation or in conjunction with other factors of problem difficulty in determining 
a complete picture of efficiency enhancement provided by fitness inheritance. 



Real- World problems: One of the key objectives of analyzing and developing fitness- 
inheritance mechanism is to aid the principled design of such a mechanism in competent 
genetic algorithms for successfully solving complex real-world problems in practical time. 

Interactive and Human-Based evolutionary algorithms: In interactive evolutionary 
algorithms the fitness of a solution is given by a human being rather than by a computation 
(Takagi, 2001). In human-based GAs, both fitness evaluations and genetic operations such as 



selection, crossover, and mutation are performed by users (Kosorukoff & Goldberg, "20021 . To 



avoid overwhelming the users, it is often infeasible to use large populations even if the search 
problem might warrant one. Fitness inheritance can be used in such cases to alleviate this 
restriction. While only a small portion of the population gets the fitness from the users, the 
rest can receive inherited fitness, thereby allowing larger population sizes but still maintaining 
the same number of function evaluations from the users. 



7 Conclusions 

In this paper, we introduced a fitness inheritance mechanism that estimates the fitness by schema 
fitness. The sub-solutions (or schemata) are automatically and adaptively identified by a proba- 
bilistic model building genetic algorithm called extended compact genetic algorithm. We have also 
developed a theoretical basis for fitness inheritance and have derived models for convergence time 
and population sizing. The convergence-time and population-sizing models were in turn used to 
predict the effect of fitness inheritance on the scalability of eCGA and also to predict the speed-up 
obtained via fitness inheritance. We observed that the fitness inheritance mechanism that incor- 
porates information of building-block structure and fitness provides a significant speed-up. For 
additively separable problems, the results show that using evaluations for only about 5-15% of the 
population, fitness inheritance can yield a speed-up of around 1.75-2.25. 
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